AD-A044  574 


UNCLASSIFIED 


\^\ 

*844574 


SEOFSE  MASHINSTON  UNIV  WASHINSTON  D C INST  FOR  MANAO— ETC  F/§  12/1 
AN  IDEALIZED  EXACT  PENALTY  FUNCTION. (U) 

JUL  77  • P MCCORNICK  AFOSR-73-2504 

SER1AL*T-35«  AF0SR-TR-77-1S1S  NL 


END 

DATE 

FILMED 


10-77 

ODC 


DOC  FILE  COPYyv  . AD  A 044574 


1 


AFOSK-TR-  7 7 - 12  13 


THE 

GEORGE 

WASHINGTON 

UNIVERSITY 


STUDENTS  FACULTY  STUDY  R 
ESEARCH  DEVELOPMENT  FUT 
URE  CAREER  CREATIVITY  CC 
MMUNITY  LEADERSHIP  TECh 
NOLOGY  FRONTiyllfeSIGN 
ENGINEERING  APP|^^MENC 
GEORGE  WASHIsM^tMLNIN 


«istrlbotlon  imllalted. 


INSIIUIIf  lOK  MANAt.lMlNI 
S(  I1N(  I AND  IN(.INIIKIN(, 

S(  I K X )|  ( )l  I N(  ,IM  I Kl\( , 
\M)  M’lMIID  M II  N<  I 


THIS  IXKUMfSr  HAS  »f[N  APPROVED  TOR  PUSIIC  REIEASE  AND  SAll,  ITS  DISTRIBUTION  IS  UNIIMITED 


AN  IDEALIZED  EXACT  PENALTY  FUNCTION 
by 

Garth  P.  McCormick 


Serial  T-359 
U July  1977 


The  George  Washington  University 
School  of  Engineering  and  Applied  Science 
Institute  fcr  Management  Science  and  Engineering 


Research  Sponsored  by 
Air  For;e  Office  of  Scientific  Research 
Air  Force  Systems  Command,  USAF 
under  AFOSR  Contract  73-2504 


This  document  has  been  approved  for  public 
sale  and  release;  its  distribution  is  unlimited. 


THE  GEORGE  WASHINGTON  UNIVERSITY 
School  of  Engineering  and  Applied  Science 
Institute  for  Management  Science  and  Engineering 


I 

I 


• Abstract 

: of 

i Serial  T-359 

I 14  July  1977 

I 


AN  IDEALIZED  EXACT  PENALTY  FUNCTION 
by 

Garth  P.  McCormick 


In  this  paper  an  idealized  exact  penalty  function  is  derived  from 
natural  considerations  of  the  flow  of  particles  under  different  forces. 

It  is  shovm  how  Fletcher's  exact  penalty  function  is  an  approximation  to 
this  one.  A second  order  version  of  the  idealized  exact  penalty  function 
Is  developed  which  is  computable. 


I J 


•-*—  AU-lHOAtAX- 

Garth  P ymcCorm'i  ck 


Serial-T-359 


:/.j\  -^AFOSR-73-250^. 


9 PER*^0RMING0HGANIZAT|0N  NAME  ANO  ADDRESS 

The  George  Washington  University 
School  of  Engineering  & Applied  Science 
Washington,  DC  20052 

n CONTROLLING  OFFICE  NAME  AND  AOORESS 

Air  Force  Office  of  Scientific  Research/NM 
Bolling  AFB,  Washington,  DC  20332 


10  program  El  fment  project,  task 

AREA  « «ORj(-t>NIT 


61 I02F  I 

RBAURT  U*  I'R  • 

li)  Jul  77J 

NUMBER  OE  PAGES 


14  MONITORING  AGENCY  N AME  % A DO  R E SSr<f  f from  Confro/J/nu  Offi  re>;  15  SEC  U Rl  T V C L ASS.  f 0/  rh/ » reporr; 


I 


UNCLASSIFIED 

'iSn  ' DECL  ASSlPl’CATlON  OO'ANGRADING 

schedule 


16  distribution  statement  fol  thin  Report) 

Approved  for  public  release;  distribution  unlimited. 


17  Distribution  statement  (oI  the  mb  a tract  entered  In  Block  20.  it  dllterent  from  K*»porfJ 


I U supplementary  notes 


19  KEY  WORDS  fConffnu#  on  reverae  aide  It  neceeamry  and  iden*lfy  by  6/oclt  number) 


exact  penalty  function 
nonlinear  programming 


^0  ABSTRACT  fConllnue  r>o  reverae  aide  It  neceeamry  and  Identify  hy  block  ntimher) 

In  this  paper  an  idealized  exact  penalty  function  is  derived  from  natural 
considerations  of  the  flow  of  particles  under  different  forces.  It  is 
shown  how  Fletcher's  exact  penalty  function  is  an  approximation  to  this 
one,  A second  order  version  of  the  idealized  exact  penalty  function  is 
developed  which  is  computable. 


DD  1473 


EDITION  OF  I NOV  «S  IS  OBSOLETE 


UNCLASSIFIED 


VO  4 


security  Cl  ASSIFICATION  OF  TmS  PAUI  F 


THE  GEORGE  WASHINGTON  UNIVERSITY 
School  of  Engineering  and  Applied  Science 
InstltuLo  for  Management  Science  and  Engineering 
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by 

Garth  P.  McCormick 


1 . Introduction 

In  [1]  and  [2]  Hetcher  developed  an  exact  penalty  function  for 
constrained  optimization  problems.  That  is,  he  showed  how  one  unconstrained 
minimization  problem  could  yield  the  solution  of  a constrained  problem 
(In  a sense  to  be  defined  in  Section  2).  In  this  paper,  an  idealized  exact 
penalty  function  is  derived  from  considerations  of  the  movement  of  a par- 
ticle under  different  forces.  This  is  done  first  for  the  equality  constrained 
problem  and  then  generalized  to  the  inequality  constrained  case. 

The  ideal ized  exact  function  has  flow  lines  similar  to  those  observed 
for  the  particle.  It  is  shown  that  Fletcher's  exact  penalty  function  is  an 
approximation  to  the  idealized  one  near  constraint  boundaries.  A new  com- 
putable exact  penalty  function  which  uses  second  order  information  is  devel- 
oped which  provider  a better  approximation  to  the  idealized  one. 

2 . Movement  of  a Parti cle  Under  Different  Forces 

Consider  the  equality  constrained  nonlinear  programming  problem: 
minimize  f(x) 


subject  to  X E R » {x|h(x)  ■ 0} 


(1) 
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where  h(x)  Is  a p by  one  vector  of  functions  and  whera  x e e”  is 
an  n dimensional  vector. 

A physical  situation  which  models  this  is  one  in  which  the  particle 

in  e”  Is  acted  upon  by  forces  which  tend  to  decrease  the  function  f (x) 
and  which  also  tend  to  drive  the  particle  to  a point  satisfying  the  con- 
straints. Many  models  of  flow  result  depending  upon  the  relative  magnitudes 
of  the  force  lines.  In  Figure  1 is  one  situation  associated  with  the  problem 

minimize  - x + y 

(x.y)  (2) 

2 2 

subject  to  X + y = 1 . 


In  Figure  1 the  lines  of  force  off  the  perimeter  of  the  circle  are  entirely 
associated  with  driving,  the  particle  to  a feasible  point.  This  is  done  in 
a way  to  minimize  the  distance  between  the  particle  and  the  feasible  region. 
Once  on  perimeter,  the  lines  of  force  become  the  gradient  of  the  objective 
function  projected  onto  the  tangent  plane. 

An  exact  penalty  function  whose  gradient  vector  approximates  this 
flow  pattern  can  be  formulated  as: 

(f[Pr(x)l  if  X E R 

E(x)  = \ (3) 

|^(x)  if  X ^ R 

where  Pr(x)  is  the  projection  function  which  maps  any  point  x into  the 
'closest  feasible  point'  and  where  /^(x)  is  the  minimum  distance  from  x 
to  the  set  R given  by 

<&(x)  = min  II  X - y II  . (4) 

y E R 

The  major  difficulty  with  this  exact  penalty  function  is  that  neither 
It  nor  its  derivative  is  continuous.  Algorithms  for  minimizing  unconstrained 
functions  require  (usually)  that  the  first  derivatives  of  the  function  be 
continuous.  Furthermore,  following  these  flow  lines  is  not  necessarily  the 
quickest  way  to  solve  tlie  problem.  A better  strategy  would  be  to  anticipate 

- 2 - 
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the  flow  near  the  perlneter  and  create  a Line  of  flow  which  is  a combina- 
tion of  tlie  projected  gradient  on  the  perimeter  and  the  gradient  of  the 
squared  distance  function.  The  resulting  lines  of  flow  would  be  those  pic- 
tured in  Figure  2.  An  associated  natural  exact  penalty  function  would  be 

E(x)  = f[x-d(x)J  + qd(x)^d(x)  (5) 


where  x - d(x)  is  a i>olnt  solving  (4)  and  q is  a scalar  greater  than  0 
There  are  difficulties  with  this  definition  Involving  uniqueness.  Note  that 
d(x)  is  not  unique  in  example  (2)  when  (x,y)  = (0,0)  . Usually  the  vector 
d(x)  is  unique  but  if  it  is  not,  to  complete  the  definition,  the  following 
is  used: 


define  D(x/  = {djx-d  solves  (4)}  . 


Let  d(x)  be  a vector  from  D(x)  such  that 


f[x-d(x)]  (6) 

minimal.  This  specifies  the  value  of  the  function  E(x)  although  not 
necessarily  the  vector  d(x)  since  theoretically  there  can  b'i  more  than  one 
vector  in  D(x)  satisfying  the  criterion  (6). 

This  exact  penalty  function  (5)  combines  a penalty  associated  with 
being  away  from  the  fersible  region  and  the  value  of  the  objective  function 
at  the  closest  feasible  point.  The  differentiability  of  the  function  (5) 
depends  upon  the  differentiability  of  the  distance  function  d(x)  . 

Some  isovalue  contours  of  this  penalty  function  associated  with  problem 
(2)  when  q = 1 are  given  in  Figure  3.  Note  that  this  is  not  continuous  at 

(0,0)  , although  from  the  definition  E(0,0)  ^ - ■Pi  + (1/4)  since 

D(0,0)  = {d|||di|^=x}  and  d(0,0)  = {^Pzll.-Plll')  . 

The  equivalence  of  the  unconstrained  minimization  of  the  idealized 
exact  penalty  function  (5)  and  the  constrained  problem  (1)  is  summarized  in 
the  next  three  theorems. 


4 


. -.4 


FIGURE  3 ISOVALUE  CONTOURS  FOR 

EXACT  PENALTY  FUNCTION 
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T}ieorem_l . (Assune  f,  { } are  continuous.)  If  x Is  a local  uncon- 
strained minimi;cer  for  (5),  it  Is  a local  mlnlmlzer  for  (1). 

Proof : First  it  will  be  shovm  that  If  d(x)  ^ 0 , for  every  0 < r.  ^ 1 , 

E[x-ed(x)]  < E(x)  . (7) 

Obviously  d(x)  0 =>  d(x-ed(x)J  ^ 0 , since  otherwise  x - cd(x)  would  be 
a closer  feasible  point  to  x than  x - d(x)  . Also  note  that 

Ix-ed(x))  - (l-E)d(x)  c k , 


and  thus  by  definition 


I |d[x-ed(x) ] I I ^ (1-c) I ld(x) I I . 


(8) 


Also  by  definition. 


l|d(x)ll  <_  I lx  - { [x-£d(x)]-dtx-ed(x)] } 1 1 


(9) 


< c||d(x){]  + ] I d (x-Gd(x) ] I I (using  the  triangular 

inequality) 

(assuming  d(x)  is  not  proportional  to  d[x-ed(x)]  ) . Together  (8)  and  (9) 
imply  that  (l-f.)d(x)  is  the  single  element  in  D[x-ed(x)]  . Because  of  (6), 

f [x-ed (x)-d [x-ed (x) ] ] + q 1 ] d [x-rd (x) ] 1 j ^ « f[x-d(x)]  + q( 1-c) ^ | 1 d (x) | 1 ^ 

2 

< f[x-d(x))  + qlld(x)ll  (because  d(x)  ^ 0 assumed) 
This  completes  the  proof  of  (7). 


Because  x is  f local  unconstrained  mlnlmlzer  it  follows  from  (7) 
that  d(x)  = 0 and  hence  that  f(x)  = E(x)  . Furthermore,  there  is  a neigh- 
borhood N(x,6)  such  that  E(x)  ^ E(x)  for  all  x t N(x,6)  . Let 
X c R n N(x , 6)  . 
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Then 

f(x)  = E(x)  (d(x)  “ 0 for  X t R) 

^ E(x)  (since  x e N(x,6)  , 

= f(x) 

This  completes  the  proof  of  the  theorem. 

Theorem  1 is  important  in  that  it  states  that  use  of  the  Idealized 
exact  penalty  function  does  not  introduce  spurious  minimizers.  In  the  next 
theorem  the  converse  is  proved. 

^eorem  2.  Suppose  f,  {hj}  are  continuous  functions.  If  x is  a local 
minimizer  for  (1),  then  it  is  an  unconstrained  local  minimizer  for  (5). 

PjlPilL’  theorem  is  not  true,  there  is  an  infinite  sequence  of  points 

{x,  } where  x,  ^ x such  that 
k k 

f[x^-d(x^)I 

1 f [X|^-d(X|^)]  + qd(X|^)^d(Xj^)  = E(Xj^) 

< E(x)  - f(x)  , 

for  all  k , with  Xj^-d(Xj^)  e R . Since  x^  - x — ► 0 , d(Xj^)  — ► 0 , and 
therefore  Xj^  - d(Xj^)  x . This  contradicts  the  assumption  that  x is  a 
minimizer  for  x t R . 


Theorem  3. 


Suppose 


f,  {hj} 


are  continuous  functions. 


If  X is  a global 


unconstrained  minimizer  for  (5),  it  is  a global  minimizer  for  (1).  If  x* 
is  a global  minimizer  for  (1),  it  is  a global  unconstrained  minimizer  for  (5). 


I^roo^:  The  proof  is  obvious  .and  will  not  be  given. 


It  is  useful  to  examine  the  idealized  exact  penalty  function  in  terms 
of  its  derivative  and  its  Hessian  matrix.  These  results  will  be  useful  in 
develo[}ing  a computable  exact  pen.alty  function  in  the  next  section. 


8 - 
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Theorem  4.  Suppose  f , { } c C . Suppose  x Is  a point  »here  f'(x) 

has  full  row  rank  and  suppose  further  that  dfx)  is  unique  and  continuously 
Uf f erent iabl c in  a neighborhood  about  x . If  x is  an  isolated  unconstrained 
minimizer  for  the  idealized  penalty  function  (5),  i.e.,  if  E'(x)  = 0 and 

E"(x)  is  a positive  d 'finite  matrix,  then  x satisfies  the  second  order 
sufficiency  conditions  for  an  Isolated  local  minimizer  for  (1). 

Proof : Assume  for  the  moment  that  at  a point  x , d'(x)  and  d"(x)  are 

defined.  Using  the  chain  rule  of  differentiation, 

E'(x)  = f’[x-d(x)](l-d(x)l  + 2q  d(x)^d’(x)  , (10) 

and 

E"(x)  = {f '[x-d(x)l  ® I^}  {-d"(x)}  + [I-d'(x)I  f"{x-d(x)]  fl-d’(x)] 

(11) 

+ 2q  d'(x)^d'(x)  + 2q[d(x)  ® l^)d"(x)  . 

Consider  any  point  x near  x • Let  z*(x)  solve  the  problem 

2 

minimize  | | z-x | | subject  to  h(z)  « 0 . 
z 

Note  that  d(x)  = x-  z*(x)  . For  x close  enough  to  x , clearly  z*(x)  is 
close  to  X and  the  matrix  f'[z*(x)l  has  rank  p . Therefore,  the  first 
order  necessary  conditions  apply  and 

[z*(x)-x]  2 + h'[z*(x)I^  u*(x)  •=  0 , 

where 

u*(x)  = - Ui' [ z*(x) 1 h' (z*(x) ]^}  ^h'[z*(x)l  (x-z*(x)]2  . 

Another  way  of  writing  this  is 

P(x)d(x)  - 0 (12) 

where 

P(x)  - I I-h’lzMx)]’^{h’tz*(x)]h'[z*(x)l'^}“^h'[z*(x)]i  . 
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Also , 


h[x-d(x)]  = hfz*(x)]  » 0 . 


Di f f erent.  in  t iiiR  (13)  yields 


h'[x-d(x)]  (I-d'(x)]  * 0 


Dif ferentiiiting  (12)  yields 


P(x)d’(x)  + [I  (55  d(x)^]i '(x)  » 0 
n 


(13) 


(U) 


(15) 


When  d(x)  = 0 , (14)  implies  that 

h'(x)  » h'(x)d’(x) 
and  using  this  in  (15)  yields 

d'(:.)  = h’(x){h'(x)h'(x)'''}h'(x)’’  . (16) 

Differentiation  of  (14)  directly  yields 

{h’(z*(x)]  05  ) (-d"(x)}  + {I-d’(x)}hJ[z*(x)]{I-d'(x)}  - 0 . (17) 

From  Theorem  2 it  is  knovm  that  d(x)  - 0 (and  therefore  that  h(x)  * 0 ) . 
Thus,  formula  (16)  can  be  used. 

Then  E'(x)  = 0 , implies,  using  (10)  and  (16),  that  the  first  order 
necessary  conditions  are  satisfied  at  x for  a constrained  minlmizer.  The 
appropriate  Lagrange  multipliers  are  given  by  the  formula 

u(x)^  = f ' (x)h' (x)^{h' (x)h' (x)^}  ^ . 

Thus , 

f'(x)  - u(7)h’(x)  . (18) 


The  first  term  In  (11)  can  be  replaced  using  (17)  and  (18)  and  summing 
appropriately  with 

- (I-d'(x))  [{u(x)'^  © I^}h"(x)]  [l-d'(x)l  . 


- 10  - 
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UsiiiR  this,  then, 

E"(x)  = P(x)|f"(x)-{u(x)”’^(5?  I^}h"(x)]p(x)  + 2qh’(x)'^[h'(-T)h’(x)'^]~n,'(7)  . (1 

Because  E"(x)  was  assumed  positive  definite,  it  f o’ lows  that 

z^P(x) [f"(x)-{u(x)^  ® I^}h"(x)] P(x)z  = z^[f"(x)  -{u(x)^(^  I^}h"(x)|z  > 0 

for  all  z where  h'(x)z  = 0 . Thus  the  second  order  sufficiency  conditions 
are  satisfied  at  x . 

3 . Fletcher's  Exact  Penalty  Function  (Equality  Case) 

Tn  [1],  Fletcher  proposed  an  exact  penalty  function  (with  variations) 
for  the  equality  constrained  optimization  problem  (1).  The  variation  closest 
to  the  natural  idealized  function  developed  in  (5)  is 

F(x)  = f (x)  - f’(.<)  h’(x)  h(x)  + qh(x)^  h'(x)  ^ h'(x)  h(x) 

where  h'(x)  is  the  Ponrose-Moore  generalized  inverse  of  h'(x)  , 

.1. 

n derivative  matrix  of  h(x)  . In  general,  for  a matrix  A , A 
unique  matrix  satisfying 

AA  A = A 

4.  4.  4. 

A AA  » A 

4.  »r  4. 

(AA  ) = AA 

(A  A)^  - a"^A  . 

Let  z*(x)  denote  a solution  of  the  minimum  distance  problem: 

2 

minimize  | | x-z  | | 
z 

subject  to  h(z)  » 0 . 


(20) 

the  p by 
is  the 


-11  - 
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A first  order  Vaylor's  scries  approximation  yields 
0 = h[z*(x)l  = h(x)  + h' (x) (z*(x)-x J , 

The  solution  to  this  approximation  with  miniumum  norm  is 
- d(x)  = z*(x)-x  * - h'(x)  h(x)  . 

Substituting  this  in  the  idealized  exact  penalty  function  (5)  using  the 
approximation  f[x-d(x)]  = f (x)  - f'(x)d(x)  yields  (20)  above  directly. 

Viewed  from  this  point  of  view,  Fletcher's  exact  penalty  function 
for  equality  constraints  is  a first-order  approximation  to  the  idealized 
penalty  function  (5).  , 

Fletcher  was  able  to  show  that  if  a point  x*  satisfied  the 
second  order  sufficiency  conditions  associated  with  (1),  it  was  an  Isolated 
local  unconstrained  minimlzer  of  the  penalty  function  (10)  for  a large  enough 
value  of  q . This  is  a weaker  theorem  than  Theorem  2 which  made  that  state- 
ment for  any  value  of  q . More  important , there  is  no  corresponding  theorem 
for  (20)  analogous  to  Theorem  1.  That  is,  the  question  of  whether  or  not  (20) 
has  local  unconstrained  minlmlzers  which  are  not  local  mlnimizers  for  the 
equality  constrained  problem  (1)  was  not  resolved.  A partial  resolution  of 
this  question  can  be  obtained  by  making  a more  precise  approximation  to 
_ z*(x)  . 

I Using  a second  order  approximation, 

- d(x)  = z*(x)  - X = - h*(x)  fh(x)  + y y(x'>1 
where  y(x)  is  a p by  1 vector  whose  jth  component  is 
Yj(x)  = h(x)^  h'(x)  ^ hj(x)  h'(x)  h(x)  . 

Using  tlie  second  order  approximation 

f(x-d(x)l  » f(x)  - f'(x)d(x)  + Y d(x)^f"(x)d(x) 
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and  substituting  tlic  d(x)  above  In  (5)  yields  (throwing  away  terms  beyond 

j 

: quadratic  In  h(x)) 

M(x)  «=  f(x)  - f'(x)  h'(x)  h(x)  - ^ f'(x)h'(x)'  y(x) 

; + h(x)^  h'(x)  ^ f"(x)  h’(x)'  h(x)  (21) 

t 

i + (jh(x)h'(x)  ^ h'(x)  h'(x)  . 

T 

When  at  x the  derivatives  (hj(x)  } , j * are  linearly 

Independent,  then 

h'(x)'  = h’(x)'^  {h'(x)h’(x)’'}“^  . 

The  quantity 

u(x)  - h’(x)"^ 

Is  an  estimate  of  the  Lagrange  multipliers  usually  associated  with  a local 
I ralnimizer  of  (1).  In  order  to  differentiate  M(x)  is  it  necessary  to  obtain 

I u'(x)  . V’hen  h'(x)  has  full  row  rank,  it  can  be  shown  that 


u'U)  . Ih'(,)h’<x)^)-*  L'p®  if(»)->.feA'<x)i)  h"(x)] 


(22) 


+ h'(x)  '’^[f"(x)-{u(x)'^  ® I^}  h”(x))  . 


Assume  tliat  at  a point  x , h'(x)  has  full  row  rank  and  that  h(x)  » 0 
Using  (22),  then  M'(x)  and  M"(x)  are 


and 


M'(x)  - f'(x)  - u(x)^h'(x)  , 

M"(x)  = - b'(x)'^[:ip®  [f ’(x)-u(x)’’'h'(x)l}  h''(x)] 

T 

- j^lp  ® (f  (x)-u(x)\'(x)}h”(x)j  h"(x) 


yj 


+ h’(x)'^[h'(x)  h>(x)'’^]  h'(x)2q 

+ P(x)[^f"(x)-{u(x)'‘’ ® h"(x)j  P(x) 


- n 


(23) 


(2A) 
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wliere 


P(x)  = I^I-h’(x) ‘h’(x> 


j^I-h’(x)'^{h’(x)h'(x)’^}”^ 


h’(x 


>] 


Theorem  5. 


Suppose 


f,  {hj>  c 


Let  X be  an  unconstrained  local 


minimizer  for  the  exact  penalty  function  M(x)  given  by  (21).  If  h(x)  = 0 , 
then  X is  a constrained  minimizer  for  problem  (1). 


Proof:  Let  x •.  R le  any  point  "close  to"  x . 

Then 


f(x)  = M(x) 
^ M(x) 
==  f(x) 


(x  c R and  therefore  h(x)  = 0) 

(x  is  a local  unconstrained  minimizer) 
(x  K R and  therefore  h(x)  » 0)  . 


The  same  theorem  under  the  same  assumptions  can  be  proved  for  Fletcher's 
penalty  function  (20). 


Theorem  6.  Suppose  f , { h j } c C . Let  x be  an  unconstrained  local 

minimizer  for  tl»e  exact  penalty  function  F(x)  given  by  (20).  If  h(x)  = 0 , 
then  X is  a local  minimizer  for  the  constrained  problem  (1). 

Proof : The  proof  is  identical  to  that  of  Theorem  5. 

The  difference  I etween  these  approximations  and  the  idealized  exact 
penalty  function  is  that  they  do  not  guarantee,  except  in  special  cases,  that 
the  penalty  function  is  the  value  of  the  objective  function  at  the  closest 
feasible  point,  plus  some  weight  of  the  squared  distance  to  that  point.  When 
the  constraints  are  linear  and  the  objective  function  quadratic,  M(x)  does 
have  this  property  and,  therefore,  one  can  show  that  unconstrained  local  mlnl- 
mizers  in  this  instance  are  constrained  local  mlnlmizers  (actually  global 
minlmizers) . Since  F(x)  is  a first-order  approximation  to  (5)  and  M(x) 
is  a second-order  approximation,  one  would  expect  that  examples  where  spurious 
local  unconstrained  minjmizers  exist  to  the  former  which  do  not  correspond 
to  constrained  minlmizers  of  (1)  would  be  more  unlikely  to  occur  for  M(x)  . 

At  present  there  arc  no  examples  of  this. 
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From  an  al  Ror  1 1 lim Ic  point  of  view,  if  In  attempting  to  minimize 
eitlier  F(x)  or  M(x)  , the  sequence  of  points  generated  is  not  tending 
toward  feaslblity.  it  would  be  no  great  trick  to  modify  the  algorltiim  to 
obtain  points  closer  to  feasibility  and  then  retry  to  minimize  the  exact 
penalty  functions  later. 

The  next  theorem  shows  another  sense  in  which  H(x)  is  closer  to  the 
idealized  exact  penalty  function.  The  Hessian  matrix  of  M(x)  at  uncon- 
strained local  minimizers  for  (1)  agrees  with  that  of  E(x)  . This  is  not 
the  case  for  F(x)  wliich  is  why  the  value  of  q Is  Impoi tant  for  showing 
that  strict  loc.il  minimizers  of  (1)  are  isolated  unconstrained  minimizers 
for  F(x)  . 

3 — — 

7.  Suppose  f , { h^ } e C . Suppose  x is  a point  where  f'(x) 

has  full  row  rank,  and  suppose  further  that  x satisfies  the  second-order 
sufficiency  conditions  for  a strict  Isolated  local  minimizer.  Then  x 
is  an  isolated  unconstrained  local  unconstrained  minimizer  for  M(x)  as 
given  by  (21)  for  any  value  of  q > 0 . 

Proof : Because  x is  feasible,  h(x)  •=  0 , and  (23)  and  (24)  are  applicable. 

From  the  first-order  optimality  conditions  for  a constrained  minimizer,  it 
is  known  that 

f(x)  - u(x)’^h'(x)  - 0 , (25) 

therefore,  M'(x)  = 0 . 

Because  of  (25),  E"(x)  takes  the  form 

P(x)[f"(x)  - {u(x)^  ® l^^h"(x)]p(x)  F 2qh'(x)^{h'(x)h’(x)’’}  h’(x)  . 

The  second  order  sufficiency  conditions  imply  that  this  is  positive  definite 
for  every  q > 0 and  therefore  that  x is  an  Isolated  unconstrained  local 
minimizer  for  (21). 
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4 . 1 nequa IJ  ty  Constrained  Problem 


Tlie  inequality  ~onst rained 
minimize  f (x) 

X 

subK'tt  to  Rj^(x) 


problem  can  be  written: 


> 0 , for  1 • I , . . . ,m  . 


Define  R = {x|gj(x)  ^ 0 , 

An  example  of  t'uis  problem 

minimize 

(x.y) 

subject  to 


i - 1, . . . ,m(  . 

is  a modification  of  Problem  (2): 
- X + y 

2 2 ^ , 

-X  - y +1^0. 


(26) 


(27) 


Lines  of  flow  associated  with  this  problem  can  take  forms  similar  to 

the  equality  constrained  Problem  (2)  with  the  major  exception  that  the  interior 

of  the  circle  is  also  now  feasible.  This  means  (see  Figure  1)  the  flow  lines 

in  the  circle  which  are  above  the  line  - x + y = 0 are  no  longer  valid.  They 

T 

would  be  replaced  by  l<nes  parallel  to  the  negative  gradient  vector  (1,-1) 

In  Figure  two,  the  modified  lines  of  flow  would  probably  look  like  those  given 

in  Figure  4.  Here  the  lines  would  follow  the  negative  gradient  path  (In  the 

interior  of  the  circle)  until  the  boundary  is  "sensed"  and  then  would  curve  as 
a combination  of  the  projected  gradient  at  the  boundary  and  the  negative  gra- 
dient vector. 

A useful  way  to  view  this  modification  Is  to  decompose  the  negative 
gradient  of  f into  tvo  parts.  One  is  the  projection  of  the  negative  gradient 
on  the  direction  whlcli  tends  to  the  closest  boundary  point,  and  the  other 
that  which  is  the  difference  of  the  negative  gradient  and  the  projected  negative 
gradient.  This  decomposition  is  shown  In  Figure  5.  A natjcal  modification  of 
the  lines  of  flow  is  then  to  truncate  the  projected  negative  gradient  vector 
if  it  extends  beyond  the  feasible  region. 

This  analysis  of  the  lines  of  force  makes  clear  the  exact  penalty  func- 
tion to  be  used  In  circumstances  near  constraint  boundaries  where  the  objective 
function  has  a lesser  value  at  the  closest  point.  The  analysis  is  dependent  upon 
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scaling  of  the  ohiective  function  and  modifications  must  be  made  for  the 
circumstances  which  can  occur  here  but  not  in  the  equality  case.  There 
is  also  a serious  combinatorial  difficulty  which  arises.  Which  of  the  many 
subspaces  defines  the  exact  penalty  function?  In  short,  the  Inequality  con- 
strained problem  poses  many  more  difficulties  than  the  equality  constrained 
one. 


Let  M = {l,...,m}  . There  are  2'"  subsets  of  M . Denote  the 
ith  subset  by  . By  convention,  let  = {<j>}  . Let  t > 0 be  a pre- 

assigned number  which  defines  the  "close"  boundary  points. 

Also,  define  d = {y|g.(y)  =0,icI,yeR}.  / 

The  minimum  value  associated  with  is  - + » if  g^(x)  ^ t 

for  some  1 e , or  if  is  empty.  Otherwise  let 


min 

VeS 


X - y 


f. 


(29) 


For  any  point  x define  “ {d|x-d  solves  (29)  above}  . Usually 

there  is  only  one  vector  in  D^(x)  . Let  vector  from  Djj^(x) 

such  that 

f(x-d^(x)l  - inf^^^^^^^f[x-dl  . (30) 

Define  ° f[x-dj^(x)]  + qd^(x)  d^(x)  . 

The  exact  penalty  function  associated  with  Problem  (26)  is 

L(x)  = min  {E  (x)}  • (31) 

t-l 2"  ' 

This  exact  penalty  function  agrees  with  (5)  when  there  are  n9  Inequality 
constraints. 

Some  iso'value  contours  associated  with  Problem  (27)  are  plotted  in 
Figure  6.  Here  t * .9  . 

¥• 
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Theorems  follow  which  relate  the  local  mlnimlzers  of  the  exact  penalty 
function  to  those  of  Problem  (26). 


Theorom__8.  Supi)ose  f , {g^}  are  continuous.  If  x is  a local  unconstrained 

minimizer  for  the  exacr  penalty  function  (31),  then  it  is  a local  minimlzer 
for  Problem  (26'i. 

Pj^oof ; Suppose  x is  a local  minimizer  for  (31).  Let  H be  any  index  de- 
fining E(x)  (usually  there  is  only  one  such  index).  Let  some 

vector  from  fo"  which  E^^(x)  is  defined  in  (30). 

Let  £ be  such  that  0 < c < 1 . Since  x - ed^(x)  - (l-e)d^(x)  c R , 

it  follows  from  the  definition  of  dj^[x-td^(x)  ] that 

II  d^[x-edjj(x)]  II  1 (1-e)  11  d^(x)  1|  . (32) 


Also  by  definition 

II  d^(x)  II  < II  X - |fx-cdj^(x)]  - d^fx-ed|^(x>]|  j| 


(33) 


l|  d (x)  11  + 1!  d [x-ed  (x)l  11  • (from  the  triangular 

inequal ity) 

The  last  inequality  is  strict  unless  d^(x)  is  proportional  to 
dj^[x-tdj^(x)  ] . Together  (32)  and  (33)  imply  that 


‘<j^[x-£:d^(x)  1 II  = (1-c)  ||d^(x)  ||  . 

Thus,  (l-f-)dp(x)  E Dp[x-cd^(x)l  . (34) 

Because  of  (30). 

f fx-td|j  (x)-dj  lx  -(.dj(x)})  ^ f {x-rdj  (x)-(l-f  )dj  (x)  ) - ffx-dj^(x)]  . (35) 
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Thus, 


E(x-ccij{x)l  < Ej  (x-cd^  (x)  ] 

;^f[x-d^(x)l  + (l-e)^d^^(x)dj^(x)q 
^ f[x-dj^(y)]  + d^(x)djj^(x)q 

= E(x)  . 


((31)) 

((34)  & (35)) 
(36) 


It  d|(x)  ^ 0,  strict  '.nequality  holds  in  (36).  Since  x is  an  assumed 
local  unconstrained  minlmizer  for  E , it  follows  that  E[x-edj(x)]  ^ E(x) 
for  t.  near  one  and  therefore  that  ” 0 . Thus,  x e R , and  also, 

E(x)  = E|^(x)  - f(x)  . (37) 

Now,  for  any  x c R , Ej^(x)  = f(x)  . Because  x is  an  unconstrained 
local  minimizer  for  E , there  is  a neighborhood  N(x,6)  such  that 
E(x)  > E(x)  for  X L N(x,6)  . Let  x e N(x,6)  O R . Hence, 


f(x)  = Ej(x) 
i E(x) 
i E(x) 

= f(x) 


(x  f.  R) 
((31)) 

(x  c N(x,6)) 
((37))  . 


Q . E . D . 


Tlie  Importance  of  this  theorem  Is  that  use  of  the  exact  penalty  function 
does  not  Introduce  spurious  candidates  for  local  minlmizers  of  the  inequality 
constrained  problem.  The  converse  of  this  tlieorem  is  not  true  without  modifi- 
cation; but  this  is  not  a strious  computational  problem.  There  are  two  partial 
converse  theorems. 

Theorem  9.  Suppose  t , {g^}  are  continuous  functions.  Suppose  x Is  some 
local  minimizer  for  Problem  (26).  Then  there  exists  a value  q > 0 such 
tliat  for  r|  > q , X Is  a local  unconstrained  minimizer  for  (31). 
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( 


!’roof:  Assume  the  coiiurary,  that  there  is  a seqiience  where  -*•  x 

such  that 

i:(x  ) ^ E(x)  < f(x) 
k — 

for  all  !<  . Let  ?-(k)  be  an  Index  defining  • 

Case  ( 1 ) . Without  loss  of  generality  assume  that  x c ^j(k) 

all  k . It  is  clear  that  for  any  value  of  q > 0 , 
the  proof  of  Theorem  2 applies  and  a contradiction 
resui ts . 


Case  . Assume  without  loss  of  generality  that  x i 

for  all  k . This  means  that  there  Is  an  index  i 
such  that  (k) ^ ° ^ ^ 0 • 

Thus , 


1 inlnf 
k— >0“ 


'^i(k) 


Let  v^  = liminf  ^ f^k~^J!,(k)  ^’‘k^  ^ ‘ 
k— ><“ 

Then,  take  q to  be  any  value  such  that 
q > [ f (x)-Vj ] /v^  . 

It  then  follows  directly  that  for  k large 
E(Xj^)  > f(x)  , 

a contradiction.  Q.E.D. 

Theorem  10.  Suppose  f , {g^}  , {h^}  are  continuous  functions.  Suppose  x 

is  some  local  minimizer  for  Problem  (26).  Then  there  exists  a value  t > 0 
such  that  for  all  0 < ' t*  , used  in  (28)  x is  a local  unconstrained 
minimlzer  for  (31)  for  every  q > 0 . 

Prool : It  t usi'd  in  del inlng  is  small  enough,  then  there  is  a neighborhood 

about  X such  that  for  every  x that  neighborhood,  x f where  l{x) 

is  an  index  defining  E(x)  . Thus,  the  Case  (i)  proof  of  Theorem  9 applies. 

o.P.n. 
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b.  Klftrher's  *''xact  Penalty  Function  (Inequality  Case) 

In  [2]  Fletcher  extended  his  penalty  function  to  the  case  when  the 
problem  is  written  with  inequality  constraints  (26).  As  indicated  in  the 
development  of  the  idealized  exact  penalty  function  in  Section  4,  there  is 
a basic  combiniitor iaJ  i)roblem  that  is  not  associated  with  the  equality  con- 
strained problem.  The  problem  is  to  determine,  at  any  point  x for  which 
thi’  penalty  function  is  to  be  defined,  which  of  the  many  constraints  define 
the  exact  penalty  function.  The  method  given  in  Section  4 is  theoretically 
valid,  but  possibly  computationally  prohibitive.  Fletcher  resolved  this 
problem  in  the  following  way. 

Cimsider  the  quadratic  optimization  problem: 

for  any  point  x , 

minimize 
6 

subject  to 

Let  u(x,q)  be  a set  of  Kuhn-Tucker-Karush  multipliers  associated 
with  the  solution  of  the  problem  and  let  6(x,q)  denote  the  solution  point. 
Since  the  objective  function  is  strictly  convex,  the  solution  point  (if  one 
exists)  is  unique  although,  in  general,  the  multipliers  are  not.  The  exact 
penalty  function  at  x is,  then, 

R(x)  = r (x)  - u(x,q)^g(x)  . (39) 

When  the  multiplier  vector  u(x,q)  is  not  unique,  the  definition  is  ambiguous. 
Furthermore  there  are  questions  of  the  continuity  of  (39)  and  its  levels  of 
d i f f ereni lab i 1 i I y . These  are  taken  up  by  Fletcher  and  under  certain  regularity 
assumptions  he  proves  theorems  concerning  these  matters. 

To  show  that  (39)  is  an  approximation  to  (31),  consider  the  following. 
Define  !/(x)  to  be  tlie  set  of  indices  such  that  g^(x)  + gj(x)  6(x,q)  = 0 , 


q + f'(x)6 


g(x)  + g' (x)6  > 0 


(38) 
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i.e..  t lu'  constraints  active  or  binding,  at  the  solution  of  (38).  Ttien  tlu* 
usual  rt'Kii  J ar  i t y assumption  is  that  tlu-  vectors 

*}>!(x)‘  , lor  i 
are  linearly  independent. 

When  this  regularity  assumption  is  satisfied,  the  multiplier  vector 
u(x,<i)  is  unique.  The  components  associated  with  the  constraints  whose 
indices  are  not  in  tc(x)  are  equal  to  zero.  Let  ]f(x)  denote  the  vector 

of  constraints  with  indices  in  v2(x)  . The  multlplier.s  for  these  constraints 
are  given  bv  the  fonnula  (<il)  below.  Then  the  exact  penalty  function  takes 
the  form 

f(x)  - ! '(x)g'(x)%(x)  + -|  q )f(x)'^^’(x)^g’(x)‘‘^;^(x)  . (40) 

because 

)^(x,q)  = - 1^'M  g'(x)^'^f ’(x)^  (41) 

and 

5(x.q)  = (x)j  f ' (x)^/q  - (x) . 

The  obvious  connection  between  this  one  and  his  equality  exact  penalty 
function  can  be  made.  This  derivation  is  different  from  the  way  in  which  (20) 
was  constructed.  The  difference  between  this  one  and  the  idealized  exact  penalty 
function  (31)  can  be  atailyzed  in  the  same  way  that  the  equality  penalty  func- 
t ions  were . 

It  is  interesting  to  show  how  Fletcher  could  have  obtained  a similar 
penalty  function  which  is  closer  but  still  essentially  different  from  the 
minimum  distance  point  of  view. 

Consider  the  problem 

1 T 

minimize  f(x-6)  + iSq 

4 (42) 

subject  to  r(x-(S)  > 0 . 


- 
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Kor  anv  x . le*  6(x,q)  denote’  a solution  point  (local),  and 
u(x,q)  .1  set  of  associated  multipliers. 

Tlie  exact  penalty  function  is  defined  at  x by 

C(x)  » f[x-6(x,q)]  + Y6(x,q)'^6(x,q)q  . (43) 

Tlie  questions  of  whether  or  not  this  has  spurious  local  minimizers  and 
wiietlier  or  not  ininimizer  of  ttie  original  problem  are  minimizers  of  this  one 
are  taken  up  next. 

Thc^oreni_  1J_.  If  f , are  continuous,  and  if  x is  local  minimizer  for 

(2h),  then  x is  a local  unconstrained  minimizer  for  the  exact  penalty  function 
(43)  above  for  an\  q > 0 . 


P^r_oof;  i’ick  c small  enough  so  that  x 
xt.  I X I I I X - X ] I <_  2/2e/q  | P R . Let 
Consider  the  neighborhood  about  0 such 
Then  for  6^^  e 3ND(D)  , 7 M II  ^9  = y 

^p.llx-aj^-xll^llx-xll  + l] 


is  a global  minimizer  for 


X e 

that 

2c 

q 

“b  I 


"1  ■{’ 

“■{ 


6 

q = G 


I X - X II  < /2c/q  I . 

I I 6 1 1 < /2^|  . 

Furthermore,  for 


< 2 /Ze/q  . Thus 


f [x-6p]  ^ f (x)  . 


Together,  then, 

*■  [x-6pl  + y 1 I 5j^|  I ^q  ^ f (x)  + c . 

But  for  (5  = (x-x)  , 

f [x-(x-x)l  + ^1'  <5  irq“f(x)+yt|  6 ll^q 

I 

< f (x)  + t . 

Therefore  tlie  inf  im'jn  is  taken  on  at  an  interior  point. 


- ?f,  - 


A 
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The  converse  th.-orem  will  now  be  proved . 

f '"jre  continuous  functions,  then  any  local  unconstrained 

minimizer  for  the  exact  penalty  function  (43)  for  any  value  of  q is  a local 
mininizer  for  the  constrained  Problem  (26). 

I^rooJ : Let  x be  i local  unconstrained  minimizer  for  (43).  We  first  show 

tliat 

6(x,q)  ' 0 . (44) 

Assume  that  6(x,q)  i 0 . Consider  points  along  the  ray  connecting 
X to  x - 6(x,q)  . The  minimum  value  of  (42)  is  clearly  less  than 

f(x-4(x,q)]  + ^-q(l-e)^  i|  6(x,q)  |1^  < f(x-6(x,q)l  + -|q  ||  6(x,q)  1I‘ 

for  L small.  This  contradicts  the  assumption.  Thus  (44)  must  hold. 

I.et  X be  any  point  feasible  to  (26),  and  "close  to"  x . Then 

f (x)  > C(x)  (since  0 is  feasible  to  (42)  when  x r.  R) 

> C.(x)  (assumption  that  x v>’as  a local  unconstrained 

mininizer) 

= f(x)  (6(x,q)  » 0)  . Q.F..D. 
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